Sparsity vs. Large Margins for Linear Classifiers

نویسندگان

Ralf Herbrich

Thore Graepel

John Shawe-Taylor

چکیده

We provide small sample size bounds on the generalisation error of linear classiiers that take advantage of large observed margins on the training set and sparsity in the data dependent expansion coeecients. It is already known from results in the luckiness framework that both criteria independently have a large impact on the generalisation error. Our new results show that they can be combined which theoretically justiies learning algorithms like the Support Vector Machine 4] or the Relevance Vector Machine 12]. In contrast to previous studies we avoid using the classical technique of symmetrisation by a ghost sample but directly using the sparsity for the estimation of the generalisation error. We demonstrate that our result leads to practical useful results even in case of small sample size if the training set witnesses our prior belief in sparsity and large margins.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparsity vs . Large Margins for Linear Classi

متن کامل

Upper Bounds for Error Rates of Linear Combinations of Classifiers

ÐA useful notion of weak dependence between many classifiers constructed with the same training data is introduced. It is shown that if both this weak dependence is low and the expected margins are large, then decison rules based on linear combinations of these classifiers can achieve error rates that decrease exponentially fast. Empirical results with randomized trees and trees constructed via...

متن کامل

The Perceptron Algorithm with Uneven Margins

The perceptron algorithm with margins is a simple, fast and effective learning algorithm for linear classifiers; it produces decision hyperplanes within some constant ratio of the maximal margin. In this paper we study this algorithm and a new variant: the perceptron algorithm with uneven margins, tailored for document categorisation problems (i.e. problems where classes are highly unbalanced a...

متن کامل

Generalized Sparse Regularization with Application to fMRI Brain Decoding

Many current medical image analysis problems involve learning thousands or even millions of model parameters from extremely few samples. Employing sparse models provides an effective means for handling the curse of dimensionality, but other propitious properties beyond sparsity are typically not modeled. In this paper, we propose a simple approach, generalized sparse regularization (GSR), for i...

متن کامل

On Security and Sparsity of Linear Classifiers for Adversarial Settings

Machine-learning techniques are widely used in security-related applications, like spam and malware detection. However, in such settings, they have been shown to be vulnerable to adversarial attacks, including the deliberate manipulation of data at test time to evade detection. In this work, we focus on the vulnerability of linear classifiers to evasion attacks. This can be considered a relevan...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Sparsity vs. Large Margins for Linear Classifiers

نویسندگان

چکیده

منابع مشابه

Sparsity vs . Large Margins for Linear Classi

Upper Bounds for Error Rates of Linear Combinations of Classifiers

The Perceptron Algorithm with Uneven Margins

Generalized Sparse Regularization with Application to fMRI Brain Decoding

On Security and Sparsity of Linear Classifiers for Adversarial Settings

عنوان ژورنال:

اشتراک گذاری